Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 536634 |
| Missing cells | 2471290 |
| Missing cells (%) | 21.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 82.4 MiB |
| Average record size in memory | 161.0 B |
Variable types
| Numeric | 17 |
|---|---|
| DateTime | 1 |
| Categorical | 2 |
| Boolean | 1 |
fuel_price is highly overall correlated with year | High correlation |
lag1_weekly_sales is highly overall correlated with rolling_avg_4weeks and 1 other fields | High correlation |
markdown1 is highly overall correlated with markdown4 and 1 other fields | High correlation |
markdown4 is highly overall correlated with markdown1 | High correlation |
markdown5 is highly overall correlated with markdown1 and 1 other fields | High correlation |
month is highly overall correlated with week | High correlation |
rolling_avg_4weeks is highly overall correlated with lag1_weekly_sales and 1 other fields | High correlation |
size is highly overall correlated with markdown5 and 1 other fields | High correlation |
store is highly overall correlated with type | High correlation |
type is highly overall correlated with size and 1 other fields | High correlation |
week is highly overall correlated with month | High correlation |
weekly_sales is highly overall correlated with lag1_weekly_sales and 1 other fields | High correlation |
year is highly overall correlated with fuel_price | High correlation |
isholiday is highly imbalanced (69.2%) | Imbalance |
temperature has 115064 (21.4%) missing values | Missing |
fuel_price has 115064 (21.4%) missing values | Missing |
markdown1 has 385953 (71.9%) missing values | Missing |
markdown2 has 425386 (79.3%) missing values | Missing |
markdown3 has 399543 (74.5%) missing values | Missing |
markdown4 has 401667 (74.8%) missing values | Missing |
markdown5 has 385202 (71.8%) missing values | Missing |
type has 115064 (21.4%) missing values | Missing |
size has 115064 (21.4%) missing values | Missing |
rolling_avg_4weeks has 9941 (1.9%) missing values | Missing |
weekly_sales has 116422 (21.7%) zeros | Zeros |
lag1_weekly_sales has 113228 (21.1%) zeros | Zeros |
rolling_avg_4weeks has 105744 (19.7%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-22 05:18:39.023961 |
|---|---|
| Analysis finished | 2025-03-22 05:20:16.418228 |
| Duration | 1 minute and 37.39 seconds |
| Software version | ydata-profiling vv4.15.1 |
| Download configuration | config.json |
Variables
store
Real number (ℝ)
High correlation 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.208621 |
| Minimum | 1 |
|---|---|
| Maximum | 45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 22 |
| Q3 | 33 |
| 95-th percentile | 43 |
| Maximum | 45 |
| Range | 44 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 12.79058 |
|---|---|
| Coefficient of variation (CV) | 0.57592862 |
| Kurtosis | -1.1472033 |
| Mean | 22.208621 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.077554812 |
| Sum | 11917901 |
| Variance | 163.59895 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 13 | 13310 | 2.5% |
| 10 | 13097 | 2.4% |
| 4 | 13075 | 2.4% |
| 2 | 13035 | 2.4% |
| 1 | 13027 | 2.4% |
| 24 | 13018 | 2.4% |
| 27 | 13016 | 2.4% |
| 6 | 12999 | 2.4% |
| 34 | 12991 | 2.4% |
| 20 | 12988 | 2.4% |
| Other values (35) | 406078 |
| Value | Count | Frequency (%) |
| 1 | 13027 | |
| 2 | 13035 | |
| 3 | 11509 | |
| 4 | 13075 | |
| 5 | 11446 | |
| 6 | 12999 | |
| 7 | 12431 | |
| 8 | 12594 | |
| 9 | 11302 | |
| 10 | 13097 |
| Value | Count | Frequency (%) |
| 45 | 12263 | |
| 44 | 9241 | |
| 43 | 8614 | |
| 42 | 8915 | |
| 41 | 12842 | |
| 40 | 12755 | |
| 39 | 12582 | |
| 38 | 9349 | |
| 37 | 9219 | |
| 36 | 7940 |
dept
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.277301 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| median | 37 |
| Q3 | 74 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 30.527358 |
|---|---|
| Coefficient of variation (CV) | 0.68945843 |
| Kurtosis | -1.2174062 |
| Mean | 44.277301 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.35914961 |
| Sum | 23760705 |
| Variance | 931.9196 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8190 | 1.5% |
| 16 | 8190 | 1.5% |
| 92 | 8190 | 1.5% |
| 38 | 8190 | 1.5% |
| 40 | 8190 | 1.5% |
| 2 | 8190 | 1.5% |
| 82 | 8190 | 1.5% |
| 46 | 8190 | 1.5% |
| 95 | 8190 | 1.5% |
| 81 | 8190 | 1.5% |
| Other values (71) | 454734 |
| Value | Count | Frequency (%) |
| 1 | 8190 | |
| 2 | 8190 | |
| 3 | 8190 | |
| 4 | 8190 | |
| 5 | 8085 | |
| 6 | 7563 | |
| 7 | 8190 | |
| 8 | 8190 | |
| 9 | 8108 | |
| 10 | 8190 |
| Value | Count | Frequency (%) |
| 99 | 1475 | 0.3% |
| 98 | 7468 | |
| 97 | 7994 | |
| 96 | 6204 | |
| 95 | 8190 | |
| 94 | 7149 | |
| 93 | 7551 | |
| 92 | 8190 | |
| 91 | 8190 | |
| 90 | 8190 |
date
Date
| Distinct | 182 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| Minimum | 2010-02-05 00:00:00 |
|---|---|
| Maximum | 2013-07-26 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
weekly_sales
Real number (ℝ)
High correlation  Zeros 
| Distinct | 358786 |
|---|---|
| Distinct (%) | 66.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12554.753 |
| Minimum | 0 |
|---|---|
| Maximum | 693099.36 |
| Zeros | 116422 |
| Zeros (%) | 21.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 49.8525 |
| median | 4118.755 |
| Q3 | 15497.417 |
| 95-th percentile | 54924.86 |
| Maximum | 693099.36 |
| Range | 693099.36 |
| Interquartile range (IQR) | 15447.565 |
Descriptive statistics
| Standard deviation | 21171.149 |
|---|---|
| Coefficient of variation (CV) | 1.6863055 |
| Kurtosis | 24.696187 |
| Mean | 12554.753 |
| Median Absolute Deviation (MAD) | 4118.755 |
| Skewness | 3.5610279 |
| Sum | 6.7373071 × 109 |
| Variance | 4.4821754 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 116422 | 21.7% |
| 10 | 353 | 0.1% |
| 5 | 289 | 0.1% |
| 20 | 232 | < 0.1% |
| 15 | 215 | < 0.1% |
| 12 | 175 | < 0.1% |
| 1 | 169 | < 0.1% |
| 10.47 | 167 | < 0.1% |
| 11.97 | 154 | < 0.1% |
| 2 | 148 | < 0.1% |
| Other values (358776) | 418310 |
| Value | Count | Frequency (%) |
| 0 | 116422 | |
| 0.01 | 33 | < 0.1% |
| 0.02 | 38 | < 0.1% |
| 0.03 | 13 | < 0.1% |
| 0.04 | 20 | < 0.1% |
| 0.05 | 9 | < 0.1% |
| 0.06 | 23 | < 0.1% |
| 0.07 | 12 | < 0.1% |
| 0.08 | 21 | < 0.1% |
| 0.09 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 693099.36 | 1 | |
| 649770.18 | 1 | |
| 630999.19 | 1 | |
| 627962.93 | 1 | |
| 474330.1 | 1 | |
| 422306.25 | 1 | |
| 420586.57 | 1 | |
| 406988.63 | 1 | |
| 404245.03 | 1 | |
| 393705.2 | 1 |
temperature
Real number (ℝ)
Missing 
| Distinct | 3528 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 115064 |
| Missing (%) | 21.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.090059 |
| Minimum | -2.06 |
|---|---|
| Maximum | 100.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 69 |
| Negative (%) | < 0.1% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | -2.06 |
|---|---|
| 5-th percentile | 27.31 |
| Q1 | 46.68 |
| median | 62.09 |
| Q3 | 74.28 |
| 95-th percentile | 87.27 |
| Maximum | 100.14 |
| Range | 102.2 |
| Interquartile range (IQR) | 27.6 |
Descriptive statistics
| Standard deviation | 18.447931 |
|---|---|
| Coefficient of variation (CV) | 0.30700471 |
| Kurtosis | -0.63592198 |
| Mean | 60.090059 |
| Median Absolute Deviation (MAD) | 13.63 |
| Skewness | -0.32140415 |
| Sum | 25332166 |
| Variance | 340.32616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50.43 | 709 | 0.1% |
| 67.87 | 646 | 0.1% |
| 72.62 | 594 | 0.1% |
| 76.67 | 583 | 0.1% |
| 70.28 | 563 | 0.1% |
| 76.03 | 555 | 0.1% |
| 50.56 | 544 | 0.1% |
| 64.05 | 542 | 0.1% |
| 64.21 | 519 | 0.1% |
| 50.81 | 487 | 0.1% |
| Other values (3518) | 415828 | |
| (Missing) | 115064 | 21.4% |
| Value | Count | Frequency (%) |
| -2.06 | 69 | |
| 5.54 | 68 | |
| 6.23 | 69 | |
| 7.46 | 69 | |
| 9.51 | 70 | |
| 9.55 | 69 | |
| 10.09 | 66 | |
| 10.11 | 68 | |
| 10.24 | 69 | |
| 10.53 | 72 |
| Value | Count | Frequency (%) |
| 100.14 | 44 | < 0.1% |
| 100.07 | 46 | < 0.1% |
| 99.66 | 48 | < 0.1% |
| 99.22 | 185 | |
| 99.2 | 46 | < 0.1% |
| 98.43 | 43 | < 0.1% |
| 98.15 | 47 | < 0.1% |
| 97.66 | 42 | < 0.1% |
| 97.6 | 48 | < 0.1% |
| 97.18 | 187 |
fuel_price
Real number (ℝ)
High correlation  Missing 
| Distinct | 892 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 115064 |
| Missing (%) | 21.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3610265 |
| Minimum | 2.472 |
|---|---|
| Maximum | 4.468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 2.472 |
|---|---|
| 5-th percentile | 2.653 |
| Q1 | 2.933 |
| median | 3.452 |
| Q3 | 3.738 |
| 95-th percentile | 4.029 |
| Maximum | 4.468 |
| Range | 1.996 |
| Interquartile range (IQR) | 0.805 |
Descriptive statistics
| Standard deviation | 0.45851454 |
|---|---|
| Coefficient of variation (CV) | 0.13642098 |
| Kurtosis | -1.1854045 |
| Mean | 3.3610265 |
| Median Absolute Deviation (MAD) | 0.375 |
| Skewness | -0.1049015 |
| Sum | 1416908 |
| Variance | 0.21023558 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.638 | 2548 | 0.5% |
| 3.63 | 2164 | 0.4% |
| 2.771 | 1917 | 0.4% |
| 3.891 | 1856 | 0.3% |
| 3.594 | 1796 | 0.3% |
| 3.524 | 1793 | 0.3% |
| 3.523 | 1792 | 0.3% |
| 2.72 | 1790 | 0.3% |
| 3.666 | 1778 | 0.3% |
| 2.78 | 1656 | 0.3% |
| Other values (882) | 402480 | |
| (Missing) | 115064 | 21.4% |
| Value | Count | Frequency (%) |
| 2.472 | 38 | < 0.1% |
| 2.513 | 45 | < 0.1% |
| 2.514 | 906 | |
| 2.52 | 39 | < 0.1% |
| 2.533 | 42 | < 0.1% |
| 2.539 | 37 | < 0.1% |
| 2.54 | 147 | < 0.1% |
| 2.542 | 45 | < 0.1% |
| 2.545 | 38 | < 0.1% |
| 2.548 | 902 |
| Value | Count | Frequency (%) |
| 4.468 | 368 | |
| 4.449 | 358 | |
| 4.308 | 168 | |
| 4.301 | 360 | |
| 4.294 | 363 | |
| 4.293 | 192 | |
| 4.288 | 172 | |
| 4.282 | 173 | |
| 4.277 | 357 | |
| 4.273 | 366 |
markdown1
Real number (ℝ)
High correlation  Missing 
| Distinct | 2277 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 385953 |
| Missing (%) | 71.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7246.4202 |
| Minimum | 0.27 |
|---|---|
| Maximum | 88646.76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0.27 |
|---|---|
| 5-th percentile | 149.19 |
| Q1 | 2240.27 |
| median | 5347.45 |
| Q3 | 9210.9 |
| 95-th percentile | 21801.35 |
| Maximum | 88646.76 |
| Range | 88646.49 |
| Interquartile range (IQR) | 6970.63 |
Descriptive statistics
| Standard deviation | 8291.2213 |
|---|---|
| Coefficient of variation (CV) | 1.1441817 |
| Kurtosis | 17.606263 |
| Mean | 7246.4202 |
| Median Absolute Deviation (MAD) | 3430.74 |
| Skewness | 3.3418447 |
| Sum | 1.0918978 × 109 |
| Variance | 68744351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.5 | 102 | < 0.1% |
| 460.73 | 102 | < 0.1% |
| 175.64 | 93 | < 0.1% |
| 1282.42 | 75 | < 0.1% |
| 9264.48 | 75 | < 0.1% |
| 686.24 | 75 | < 0.1% |
| 5924.71 | 75 | < 0.1% |
| 1483.17 | 75 | < 0.1% |
| 3124.45 | 74 | < 0.1% |
| 6809.96 | 74 | < 0.1% |
| Other values (2267) | 149861 | 27.9% |
| (Missing) | 385953 |
| Value | Count | Frequency (%) |
| 0.27 | 51 | |
| 0.5 | 49 | |
| 1.5 | 102 | |
| 1.94 | 50 | |
| 2.12 | 52 | |
| 2.4 | 49 | |
| 2.42 | 50 | |
| 2.43 | 51 | |
| 2.8 | 50 | |
| 2.91 | 51 |
| Value | Count | Frequency (%) |
| 88646.76 | 68 | |
| 78124.5 | 70 | |
| 75149.79 | 73 | |
| 65021.23 | 73 | |
| 62567.6 | 66 | |
| 62172.73 | 72 | |
| 60740.64 | 70 | |
| 60394.73 | 72 | |
| 58928.52 | 72 | |
| 56917.7 | 71 |
markdown2
Real number (ℝ)
Missing 
| Distinct | 1499 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 425386 |
| Missing (%) | 79.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3334.6286 |
| Minimum | -265.76 |
|---|---|
| Maximum | 104519.54 |
| Zeros | 207 |
| Zeros (%) | < 0.1% |
| Negative | 1311 |
| Negative (%) | 0.2% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | -265.76 |
|---|---|
| 5-th percentile | 1.95 |
| Q1 | 41.6 |
| median | 192 |
| Q3 | 1926.94 |
| 95-th percentile | 16497.47 |
| Maximum | 104519.54 |
| Range | 104785.3 |
| Interquartile range (IQR) | 1885.34 |
Descriptive statistics
| Standard deviation | 9475.3573 |
|---|---|
| Coefficient of variation (CV) | 2.841503 |
| Kurtosis | 37.589561 |
| Mean | 3334.6286 |
| Median Absolute Deviation (MAD) | 184.73 |
| Skewness | 5.4412612 |
| Sum | 3.7097076 × 108 |
| Variance | 89782396 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.91 | 539 | 0.1% |
| 3 | 493 | 0.1% |
| 0.5 | 485 | 0.1% |
| 1.5 | 471 | 0.1% |
| 4 | 367 | 0.1% |
| 6 | 365 | 0.1% |
| 7.64 | 354 | 0.1% |
| 3.82 | 353 | 0.1% |
| 19 | 345 | 0.1% |
| 5.73 | 345 | 0.1% |
| Other values (1489) | 107131 | 20.0% |
| (Missing) | 425386 |
| Value | Count | Frequency (%) |
| -265.76 | 71 | |
| -192 | 72 | |
| -20 | 72 | |
| -10.98 | 60 | |
| -10.5 | 143 | |
| -9.98 | 68 | |
| -9.94 | 62 | |
| -7.6 | 69 | |
| -7.01 | 69 | |
| -6.69 | 69 |
| Value | Count | Frequency (%) |
| 104519.54 | 72 | |
| 97740.99 | 73 | |
| 92523.94 | 73 | |
| 89121.94 | 74 | |
| 82881.16 | 73 | |
| 72413.71 | 72 | |
| 70574.85 | 71 | |
| 58804.91 | 69 | |
| 58046.41 | 71 | |
| 56106.2 | 72 |
markdown3
Real number (ℝ)
Missing 
| Distinct | 1662 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 399543 |
| Missing (%) | 74.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1439.4214 |
| Minimum | -29.1 |
|---|---|
| Maximum | 141630.61 |
| Zeros | 67 |
| Zeros (%) | < 0.1% |
| Negative | 257 |
| Negative (%) | < 0.1% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | -29.1 |
|---|---|
| 5-th percentile | 0.65 |
| Q1 | 5.08 |
| median | 24.6 |
| Q3 | 103.99 |
| 95-th percentile | 1059.9 |
| Maximum | 141630.61 |
| Range | 141659.71 |
| Interquartile range (IQR) | 98.91 |
Descriptive statistics
| Standard deviation | 9623.0783 |
|---|---|
| Coefficient of variation (CV) | 6.6853796 |
| Kurtosis | 77.687772 |
| Mean | 1439.4214 |
| Median Absolute Deviation (MAD) | 22.6 |
| Skewness | 8.399453 |
| Sum | 1.9733172 × 108 |
| Variance | 92603636 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 754 | 0.1% |
| 6 | 710 | 0.1% |
| 2 | 660 | 0.1% |
| 1 | 611 | 0.1% |
| 0.22 | 487 | 0.1% |
| 0.5 | 463 | 0.1% |
| 0.01 | 444 | 0.1% |
| 4 | 439 | 0.1% |
| 3.2 | 379 | 0.1% |
| 1.98 | 363 | 0.1% |
| Other values (1652) | 131781 | 24.6% |
| (Missing) | 399543 |
| Value | Count | Frequency (%) |
| -29.1 | 72 | < 0.1% |
| -1 | 70 | < 0.1% |
| -0.87 | 46 | < 0.1% |
| -0.2 | 69 | < 0.1% |
| 0 | 67 | < 0.1% |
| 0.01 | 444 | |
| 0.02 | 124 | < 0.1% |
| 0.04 | 241 | |
| 0.05 | 71 | < 0.1% |
| 0.06 | 205 |
| Value | Count | Frequency (%) |
| 141630.61 | 74 | |
| 109030.75 | 75 | |
| 103991.94 | 72 | |
| 101378.79 | 73 | |
| 89402.64 | 71 | |
| 88805.58 | 72 | |
| 83340.33 | 74 | |
| 83192.81 | 74 | |
| 79621.2 | 72 | |
| 77451.26 | 73 |
markdown4
Real number (ℝ)
High correlation  Missing 
| Distinct | 1944 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 401667 |
| Missing (%) | 74.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3383.1683 |
| Minimum | 0.22 |
|---|---|
| Maximum | 67474.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0.22 |
|---|---|
| 5-th percentile | 28.76 |
| Q1 | 504.22 |
| median | 1481.31 |
| Q3 | 3595.04 |
| 95-th percentile | 12645.96 |
| Maximum | 67474.85 |
| Range | 67474.63 |
| Interquartile range (IQR) | 3090.82 |
Descriptive statistics
| Standard deviation | 6292.384 |
|---|---|
| Coefficient of variation (CV) | 1.8599087 |
| Kurtosis | 29.996815 |
| Mean | 3383.1683 |
| Median Absolute Deviation (MAD) | 1167.55 |
| Skewness | 4.8475 |
| Sum | 4.5661607 × 108 |
| Variance | 39594097 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 280 | 0.1% |
| 4 | 200 | < 0.1% |
| 2 | 197 | < 0.1% |
| 3 | 146 | < 0.1% |
| 47 | 143 | < 0.1% |
| 67.72 | 142 | < 0.1% |
| 657.56 | 141 | < 0.1% |
| 17 | 141 | < 0.1% |
| 8 | 140 | < 0.1% |
| 1330.36 | 140 | < 0.1% |
| Other values (1934) | 133297 | 24.8% |
| (Missing) | 401667 |
| Value | Count | Frequency (%) |
| 0.22 | 57 | < 0.1% |
| 0.41 | 52 | < 0.1% |
| 0.46 | 48 | < 0.1% |
| 0.78 | 52 | < 0.1% |
| 0.87 | 49 | < 0.1% |
| 0.92 | 45 | < 0.1% |
| 1.5 | 55 | < 0.1% |
| 1.88 | 48 | < 0.1% |
| 1.98 | 44 | < 0.1% |
| 2 | 197 |
| Value | Count | Frequency (%) |
| 67474.85 | 72 | |
| 57817.56 | 74 | |
| 57815.43 | 68 | |
| 53603.99 | 72 | |
| 52739.02 | 72 | |
| 48403.53 | 70 | |
| 48159.86 | 73 | |
| 48086.64 | 72 | |
| 47452.43 | 73 | |
| 46238.28 | 71 |
markdown5
Real number (ℝ)
High correlation  Missing 
| Distinct | 2293 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 385202 |
| Missing (%) | 71.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4628.9751 |
| Minimum | 135.16 |
|---|---|
| Maximum | 108519.28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 135.16 |
|---|---|
| 5-th percentile | 715.52 |
| Q1 | 1878.44 |
| median | 3359.45 |
| Q3 | 5563.8 |
| 95-th percentile | 11269.24 |
| Maximum | 108519.28 |
| Range | 108384.12 |
| Interquartile range (IQR) | 3685.36 |
Descriptive statistics
| Standard deviation | 5962.8875 |
|---|---|
| Coefficient of variation (CV) | 1.2881658 |
| Kurtosis | 107.84927 |
| Mean | 4628.9751 |
| Median Absolute Deviation (MAD) | 1702.47 |
| Skewness | 8.1699095 |
| Sum | 7.0097495 × 108 |
| Variance | 35556027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2743.18 | 136 | < 0.1% |
| 1064.56 | 120 | < 0.1% |
| 9083.54 | 75 | < 0.1% |
| 3567.03 | 75 | < 0.1% |
| 3557.67 | 75 | < 0.1% |
| 20371.02 | 75 | < 0.1% |
| 4180.29 | 75 | < 0.1% |
| 1773.53 | 74 | < 0.1% |
| 3932.94 | 74 | < 0.1% |
| 4464.45 | 74 | < 0.1% |
| Other values (2283) | 150579 | 28.1% |
| (Missing) | 385202 |
| Value | Count | Frequency (%) |
| 135.16 | 65 | |
| 153.04 | 47 | |
| 153.9 | 49 | |
| 164.08 | 52 | |
| 170.64 | 69 | |
| 171.76 | 71 | |
| 180.07 | 64 | |
| 212.75 | 50 | |
| 224.86 | 50 | |
| 227.12 | 48 |
| Value | Count | Frequency (%) |
| 108519.28 | 68 | |
| 105223.11 | 70 | |
| 85851.87 | 68 | |
| 63005.58 | 69 | |
| 58068.14 | 69 | |
| 57029.78 | 68 | |
| 53212.72 | 70 | |
| 37581.27 | 70 | |
| 36430.33 | 71 | |
| 36360.42 | 72 |
cpi
Real number (ℝ)
| Distinct | 2145 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.29408 |
| Minimum | 126.064 |
|---|---|
| Maximum | 227.23281 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 126.064 |
|---|---|
| 5-th percentile | 126.60506 |
| Q1 | 132.15213 |
| median | 182.53209 |
| Q3 | 214.19216 |
| 95-th percentile | 223.71143 |
| Maximum | 227.23281 |
| Range | 101.16881 |
| Interquartile range (IQR) | 82.040028 |
Descriptive statistics
| Standard deviation | 39.624185 |
|---|---|
| Coefficient of variation (CV) | 0.22997995 |
| Kurtosis | -1.8228877 |
| Mean | 172.29408 |
| Median Absolute Deviation (MAD) | 42.097336 |
| Skewness | 0.088032193 |
| Sum | 92458864 |
| Variance | 1570.076 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 131.1930968 | 27712 | 5.2% |
| 138.7281613 | 22235 | 4.1% |
| 199.2195317 | 11075 | 2.1% |
| 223.0783366 | 10381 | 1.9% |
| 142.7624113 | 8191 | 1.5% |
| 222.1136566 | 6600 | 1.2% |
| 216.1515902 | 5597 | 1.0% |
| 192.3088989 | 5467 | 1.0% |
| 226.9873637 | 5363 | 1.0% |
| 225.0686254 | 2860 | 0.5% |
| Other values (2135) | 431153 |
| Value | Count | Frequency (%) |
| 126.064 | 678 | |
| 126.0766452 | 679 | |
| 126.0854516 | 675 | |
| 126.0892903 | 682 | |
| 126.1019355 | 686 | |
| 126.1069032 | 681 | |
| 126.1119032 | 682 | |
| 126.114 | 687 | |
| 126.1145806 | 689 | |
| 126.1266 | 683 |
| Value | Count | Frequency (%) |
| 227.2328068 | 2498 | |
| 227.214288 | 62 | < 0.1% |
| 227.1693919 | 63 | < 0.1% |
| 227.0369359 | 2769 | |
| 227.0184166 | 69 | < 0.1% |
| 226.9873637 | 5363 | |
| 226.9735448 | 69 | < 0.1% |
| 226.9688442 | 134 | < 0.1% |
| 226.9662325 | 63 | < 0.1% |
| 226.9239785 | 135 | < 0.1% |
unemployment
Real number (ℝ)
| Distinct | 349 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.7427396 |
| Minimum | 3.879 |
|---|---|
| Maximum | 14.313 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 3.879 |
|---|---|
| 5-th percentile | 5.114 |
| Q1 | 6.565 |
| median | 7.725 |
| Q3 | 8.549 |
| 95-th percentile | 10.641 |
| Maximum | 14.313 |
| Range | 10.434 |
| Interquartile range (IQR) | 1.984 |
Descriptive statistics
| Standard deviation | 1.8561704 |
|---|---|
| Coefficient of variation (CV) | 0.23973045 |
| Kurtosis | 2.4980059 |
| Mean | 7.7427396 |
| Median Absolute Deviation (MAD) | 0.942 |
| Skewness | 1.0252271 |
| Sum | 4155017.3 |
| Variance | 3.4453687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.17 | 11159 | 2.1% |
| 10.199 | 8125 | 1.5% |
| 6.228 | 7101 | 1.3% |
| 7.992 | 6061 | 1.1% |
| 7.293 | 6016 | 1.1% |
| 4.145 | 6003 | 1.1% |
| 7.557 | 5987 | 1.1% |
| 8.839 | 5971 | 1.1% |
| 8.667 | 5877 | 1.1% |
| 6.034 | 5767 | 1.1% |
| Other values (339) | 468567 |
| Value | Count | Frequency (%) |
| 3.879 | 3090 | |
| 4.077 | 938 | 0.2% |
| 4.125 | 1831 | 0.3% |
| 4.145 | 6003 | |
| 4.156 | 1815 | 0.3% |
| 4.261 | 1829 | 0.3% |
| 4.308 | 935 | 0.2% |
| 4.42 | 1855 | 0.3% |
| 4.584 | 1988 | 0.4% |
| 4.607 | 935 | 0.2% |
| Value | Count | Frequency (%) |
| 14.313 | 2636 | |
| 14.18 | 2423 | |
| 14.099 | 2441 | |
| 14.021 | 2263 | |
| 13.975 | 1529 | |
| 13.736 | 2464 | |
| 13.503 | 2661 | |
| 12.89 | 2491 | |
| 12.187 | 2507 | |
| 11.627 | 2502 |
type
Categorical
High correlation  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 115064 |
| Missing (%) | 21.4% |
| Memory size | 4.1 MiB |
| A | |
|---|---|
| B | |
| C |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 7.9% |
| (Missing) | 115064 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 215478 | |
| b | 163495 | |
| c | 42597 | 10.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
size
Real number (ℝ)
High correlation  Missing 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 115064 |
| Missing (%) | 21.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136727.92 |
| Minimum | 34875 |
|---|---|
| Maximum | 219622 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 34875 |
|---|---|
| 5-th percentile | 39690 |
| Q1 | 93638 |
| median | 140167 |
| Q3 | 202505 |
| 95-th percentile | 206302 |
| Maximum | 219622 |
| Range | 184747 |
| Interquartile range (IQR) | 108867 |
Descriptive statistics
| Standard deviation | 60980.583 |
|---|---|
| Coefficient of variation (CV) | 0.44599951 |
| Kurtosis | -1.2063459 |
| Mean | 136727.92 |
| Median Absolute Deviation (MAD) | 62140 |
| Skewness | -0.32584977 |
| Sum | 5.7640387 × 1010 |
| Variance | 3.7186315 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39690 | 20802 | 3.9% |
| 39910 | 20597 | 3.8% |
| 203819 | 20376 | 3.8% |
| 219622 | 10474 | 2.0% |
| 126512 | 10315 | 1.9% |
| 205863 | 10272 | 1.9% |
| 151315 | 10244 | 1.9% |
| 202307 | 10238 | 1.9% |
| 204184 | 10225 | 1.9% |
| 158114 | 10224 | 1.9% |
| Other values (30) | 287803 | |
| (Missing) | 115064 | 21.4% |
| Value | Count | Frequency (%) |
| 34875 | 8999 | |
| 37392 | 9036 | |
| 39690 | 20802 | |
| 39910 | 20597 | |
| 41062 | 6751 | 1.3% |
| 42988 | 7156 | 1.3% |
| 57197 | 9443 | |
| 70713 | 9762 | |
| 93188 | 9864 | |
| 93638 | 9455 |
| Value | Count | Frequency (%) |
| 219622 | 10474 | |
| 207499 | 10062 | |
| 206302 | 10113 | |
| 205863 | 10272 | |
| 204184 | 10225 | |
| 203819 | 20376 | |
| 203750 | 10142 | |
| 203742 | 10214 | |
| 203007 | 10202 | |
| 202505 | 10211 |
isholiday
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 524.2 KiB |
| False | |
|---|---|
| True | 29661 |
| Value | Count | Frequency (%) |
| False | 506973 | |
| True | 29661 | 5.5% |
year
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| 2012 | |
|---|---|
| 2011 | |
| 2010 | |
| 2013 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2010 |
|---|---|
| 2nd row | 2010 |
| 3rd row | 2010 |
| 4th row | 2010 |
| 5th row | 2010 |
Common Values
| Value | Count | Frequency (%) |
| 2012 | 154227 | |
| 2011 | 153453 | |
| 2010 | 140679 | |
| 2013 | 88275 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2012 | 154227 | |
| 2011 | 153453 | |
| 2010 | 140679 | |
| 2013 | 88275 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 690861 | |
| 1 | 690087 | |
| 0 | 677313 | |
| 3 | 88275 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2146536 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 690861 | |
| 1 | 690087 | |
| 0 | 677313 | |
| 3 | 88275 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2146536 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 690861 | |
| 1 | 690087 | |
| 0 | 677313 | |
| 3 | 88275 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2146536 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 690861 | |
| 1 | 690087 | |
| 0 | 677313 | |
| 3 | 88275 | 4.1% |
month
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2952031 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.333808 |
|---|---|
| Coefficient of variation (CV) | 0.5295791 |
| Kurtosis | -1.1222048 |
| Mean | 6.2952031 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.15184977 |
| Sum | 3378220 |
| Variance | 11.114276 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 53128 | |
| 4 | 53119 | |
| 7 | 52712 | |
| 5 | 50040 | |
| 6 | 49841 | |
| 2 | 47376 | |
| 12 | 41767 | |
| 11 | 38437 | |
| 10 | 38362 | |
| 9 | 38339 | |
| Other values (2) | 73513 |
| Value | Count | Frequency (%) |
| 1 | 35344 | |
| 2 | 47376 | |
| 3 | 53128 | |
| 4 | 53119 | |
| 5 | 50040 | |
| 6 | 49841 | |
| 7 | 52712 | |
| 8 | 38169 | |
| 9 | 38339 | |
| 10 | 38362 |
| Value | Count | Frequency (%) |
| 12 | 41767 | |
| 11 | 38437 | |
| 10 | 38362 | |
| 9 | 38339 | |
| 8 | 38169 | |
| 7 | 52712 | |
| 6 | 49841 | |
| 5 | 50040 | |
| 4 | 53119 | |
| 3 | 53128 |
week
Real number (ℝ)
High correlation 
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.231581 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 13 |
| median | 24 |
| Q3 | 37 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.554119 |
|---|---|
| Coefficient of variation (CV) | 0.57682154 |
| Kurtosis | -1.1108094 |
| Mean | 25.231581 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.161594 |
| Sum | 13540124 |
| Variance | 211.82238 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 11913 | 2.2% |
| 6 | 11859 | 2.2% |
| 9 | 11830 | 2.2% |
| 10 | 11830 | 2.2% |
| 15 | 11821 | 2.2% |
| 19 | 11816 | 2.2% |
| 14 | 11815 | 2.2% |
| 16 | 11811 | 2.2% |
| 5 | 11809 | 2.2% |
| 18 | 11806 | 2.2% |
| Other values (42) | 418324 |
| Value | Count | Frequency (%) |
| 1 | 8867 | |
| 2 | 8838 | |
| 3 | 8827 | |
| 4 | 8812 | |
| 5 | 11809 | |
| 6 | 11859 | |
| 7 | 11913 | |
| 8 | 11795 | |
| 9 | 11830 | |
| 10 | 11830 |
| Value | Count | Frequency (%) |
| 52 | 8934 | |
| 51 | 8985 | |
| 50 | 8958 | |
| 49 | 8947 | |
| 48 | 8905 | |
| 47 | 8935 | |
| 46 | 8833 | |
| 45 | 8864 | |
| 44 | 8843 | |
| 43 | 8855 |
lag1_weekly_sales
Real number (ℝ)
High correlation  Zeros 
| Distinct | 358786 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 3342 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12633.406 |
| Minimum | 0 |
|---|---|
| Maximum | 693099.36 |
| Zeros | 113228 |
| Zeros (%) | 21.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 62.1125 |
| median | 4201.51 |
| Q3 | 15615.45 |
| 95-th percentile | 55074.468 |
| Maximum | 693099.36 |
| Range | 693099.36 |
| Interquartile range (IQR) | 15553.338 |
Descriptive statistics
| Standard deviation | 21213.982 |
|---|---|
| Coefficient of variation (CV) | 1.6791973 |
| Kurtosis | 24.591821 |
| Mean | 12633.406 |
| Median Absolute Deviation (MAD) | 4201.51 |
| Skewness | 3.5518573 |
| Sum | 6.7372944 × 109 |
| Variance | 4.5003303 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 113228 | 21.1% |
| 10 | 349 | 0.1% |
| 5 | 283 | 0.1% |
| 20 | 231 | < 0.1% |
| 15 | 212 | < 0.1% |
| 12 | 168 | < 0.1% |
| 10.47 | 167 | < 0.1% |
| 1 | 162 | < 0.1% |
| 11.97 | 151 | < 0.1% |
| 2 | 146 | < 0.1% |
| Other values (358776) | 418195 | |
| (Missing) | 3342 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 113228 | |
| 0.01 | 33 | < 0.1% |
| 0.02 | 38 | < 0.1% |
| 0.03 | 13 | < 0.1% |
| 0.04 | 20 | < 0.1% |
| 0.05 | 9 | < 0.1% |
| 0.06 | 23 | < 0.1% |
| 0.07 | 12 | < 0.1% |
| 0.08 | 21 | < 0.1% |
| 0.09 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 693099.36 | 1 | |
| 649770.18 | 1 | |
| 630999.19 | 1 | |
| 627962.93 | 1 | |
| 474330.1 | 1 | |
| 422306.25 | 1 | |
| 420586.57 | 1 | |
| 406988.63 | 1 | |
| 404245.03 | 1 | |
| 393705.2 | 1 |
rolling_avg_4weeks
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 399926 |
|---|---|
| Distinct (%) | 75.9% |
| Missing | 9941 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12651.872 |
| Minimum | 0 |
|---|---|
| Maximum | 339472.76 |
| Zeros | 105744 |
| Zeros (%) | 19.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 105.0875 |
| median | 4347.8375 |
| Q3 | 15789.12 |
| 95-th percentile | 54730.856 |
| Maximum | 339472.76 |
| Range | 339472.76 |
| Interquartile range (IQR) | 15684.033 |
Descriptive statistics
| Standard deviation | 20735.369 |
|---|---|
| Coefficient of variation (CV) | 1.6389171 |
| Kurtosis | 13.478833 |
| Mean | 12651.872 |
| Median Absolute Deviation (MAD) | 4347.8375 |
| Skewness | 3.1006095 |
| Sum | 6.6636522 × 109 |
| Variance | 4.2995553 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 105744 | 19.7% |
| 3.75 | 46 | < 0.1% |
| 15 | 36 | < 0.1% |
| 17.5 | 32 | < 0.1% |
| 10 | 30 | < 0.1% |
| 12.5 | 28 | < 0.1% |
| 5 | 28 | < 0.1% |
| 8.75 | 28 | < 0.1% |
| 96.8 | 27 | < 0.1% |
| 99 | 27 | < 0.1% |
| Other values (399916) | 420667 | |
| (Missing) | 9941 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 105744 | |
| 0.0025 | 1 | < 0.1% |
| 0.005 | 1 | < 0.1% |
| 0.0075 | 1 | < 0.1% |
| 0.01 | 2 | < 0.1% |
| 0.0125 | 1 | < 0.1% |
| 0.015 | 2 | < 0.1% |
| 0.0175 | 1 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.0225 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 339472.7575 | 1 | |
| 304916.97 | 1 | |
| 301363.6975 | 1 | |
| 276995.57 | 1 | |
| 275834.4775 | 1 | |
| 275824.745 | 1 | |
| 273813.13 | 1 | |
| 272014.385 | 1 | |
| 267259.175 | 1 | |
| 249502.33 | 1 |
Interactions
Correlations
| cpi | dept | fuel_price | isholiday | lag1_weekly_sales | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | month | rolling_avg_4weeks | size | store | temperature | type | unemployment | week | weekly_sales | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| cpi | 1.000 | -0.009 | -0.041 | 0.034 | -0.123 | -0.017 | -0.099 | -0.111 | -0.063 | 0.021 | -0.003 | -0.121 | -0.005 | -0.227 | 0.173 | 0.183 | -0.391 | 0.001 | -0.125 | 0.299 |
| dept | -0.009 | 1.000 | 0.003 | 0.000 | -0.009 | 0.002 | 0.003 | 0.006 | 0.007 | 0.006 | 0.001 | -0.010 | 0.011 | 0.014 | 0.001 | 0.080 | 0.005 | 0.001 | -0.009 | 0.012 |
| fuel_price | -0.041 | 0.003 | 1.000 | 0.136 | 0.000 | 0.163 | -0.155 | -0.218 | 0.073 | -0.088 | -0.046 | -0.002 | 0.004 | 0.074 | 0.128 | 0.088 | -0.060 | -0.035 | 0.002 | 0.652 |
| isholiday | 0.034 | 0.000 | 0.136 | 1.000 | 0.037 | 0.057 | 0.359 | 0.458 | 0.115 | 0.060 | 0.334 | 0.031 | 0.000 | 0.000 | 0.186 | 0.000 | 0.054 | 0.384 | 0.035 | 0.134 |
| lag1_weekly_sales | -0.123 | -0.009 | 0.000 | 0.037 | 1.000 | 0.191 | 0.056 | 0.127 | 0.105 | 0.208 | 0.100 | 0.986 | 0.290 | -0.065 | -0.021 | 0.086 | 0.145 | 0.092 | 0.983 | 0.046 |
| markdown1 | -0.017 | 0.002 | 0.163 | 0.057 | 0.191 | 1.000 | 0.206 | 0.154 | 0.759 | 0.508 | -0.169 | 0.190 | 0.499 | -0.212 | 0.002 | 0.172 | 0.064 | -0.189 | 0.192 | 0.144 |
| markdown2 | -0.099 | 0.003 | -0.155 | 0.359 | 0.056 | 0.206 | 1.000 | 0.066 | 0.116 | 0.152 | -0.431 | 0.057 | 0.149 | 0.009 | -0.462 | 0.066 | 0.060 | -0.434 | 0.032 | 0.259 |
| markdown3 | -0.111 | 0.006 | -0.218 | 0.458 | 0.127 | 0.154 | 0.066 | 1.000 | 0.002 | 0.244 | 0.287 | 0.131 | 0.300 | -0.065 | -0.257 | 0.065 | 0.043 | 0.274 | 0.135 | 0.323 |
| markdown4 | -0.063 | 0.007 | 0.073 | 0.115 | 0.105 | 0.759 | 0.116 | 0.002 | 1.000 | 0.380 | -0.104 | 0.101 | 0.288 | -0.039 | 0.141 | 0.066 | 0.038 | -0.132 | 0.112 | 0.132 |
| markdown5 | 0.021 | 0.006 | -0.088 | 0.060 | 0.208 | 0.508 | 0.152 | 0.244 | 0.380 | 1.000 | 0.048 | 0.210 | 0.579 | -0.156 | -0.071 | 0.094 | -0.019 | 0.032 | 0.208 | 0.229 |
| month | -0.003 | 0.001 | -0.046 | 0.334 | 0.100 | -0.169 | -0.431 | 0.287 | -0.104 | 0.048 | 1.000 | 0.119 | -0.001 | 0.001 | 0.241 | 0.000 | 0.010 | 0.996 | 0.084 | 0.208 |
| rolling_avg_4weeks | -0.121 | -0.010 | -0.002 | 0.031 | 0.986 | 0.190 | 0.057 | 0.131 | 0.101 | 0.210 | 0.119 | 1.000 | 0.293 | -0.067 | -0.025 | 0.111 | 0.140 | 0.110 | 0.975 | 0.094 |
| size | -0.005 | 0.011 | 0.004 | 0.000 | 0.290 | 0.499 | 0.149 | 0.300 | 0.288 | 0.579 | -0.001 | 0.293 | 1.000 | -0.160 | -0.043 | 0.851 | -0.066 | -0.001 | 0.290 | 0.001 |
| store | -0.227 | 0.014 | 0.074 | 0.000 | -0.065 | -0.212 | 0.009 | -0.065 | -0.039 | -0.156 | 0.001 | -0.067 | -0.160 | 1.000 | -0.057 | 0.538 | 0.279 | 0.001 | -0.064 | 0.000 |
| temperature | 0.173 | 0.001 | 0.128 | 0.186 | -0.021 | 0.002 | -0.462 | -0.257 | 0.141 | -0.071 | 0.241 | -0.025 | -0.043 | -0.057 | 1.000 | 0.123 | 0.030 | 0.243 | -0.020 | 0.113 |
| type | 0.183 | 0.080 | 0.088 | 0.000 | 0.086 | 0.172 | 0.066 | 0.065 | 0.066 | 0.094 | 0.000 | 0.111 | 0.851 | 0.538 | 0.123 | 1.000 | 0.181 | 0.000 | 0.086 | 0.003 |
| unemployment | -0.391 | 0.005 | -0.060 | 0.054 | 0.145 | 0.064 | 0.060 | 0.043 | 0.038 | -0.019 | 0.010 | 0.140 | -0.066 | 0.279 | 0.030 | 0.181 | 1.000 | 0.005 | 0.149 | 0.272 |
| week | 0.001 | 0.001 | -0.035 | 0.384 | 0.092 | -0.189 | -0.434 | 0.274 | -0.132 | 0.032 | 0.996 | 0.110 | -0.001 | 0.001 | 0.243 | 0.000 | 0.005 | 1.000 | 0.076 | 0.198 |
| weekly_sales | -0.125 | -0.009 | 0.002 | 0.035 | 0.983 | 0.192 | 0.032 | 0.135 | 0.112 | 0.208 | 0.084 | 0.975 | 0.290 | -0.064 | -0.020 | 0.086 | 0.149 | 0.076 | 1.000 | 0.046 |
| year | 0.299 | 0.012 | 0.652 | 0.134 | 0.046 | 0.144 | 0.259 | 0.323 | 0.132 | 0.229 | 0.208 | 0.094 | 0.001 | 0.000 | 0.113 | 0.003 | 0.272 | 0.198 | 0.046 | 1.000 |
Missing values
Sample
| store | dept | date | weekly_sales | temperature | fuel_price | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | cpi | unemployment | type | size | isholiday | year | month | week | lag1_weekly_sales | rolling_avg_4weeks | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | 2010-02-05 | 24924.50 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 1 | 1 | 2 | 2010-02-05 | 50605.27 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 2 | 1 | 3 | 2010-02-05 | 13740.12 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 3 | 1 | 4 | 2010-02-05 | 39954.04 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 4 | 1 | 5 | 2010-02-05 | 32229.38 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 5 | 1 | 6 | 2010-02-05 | 5749.03 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 6 | 1 | 7 | 2010-02-05 | 21084.08 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 7 | 1 | 8 | 2010-02-05 | 40129.01 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 8 | 1 | 9 | 2010-02-05 | 16930.99 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| 9 | 1 | 10 | 2010-02-05 | 30721.50 | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315.0 | False | 2010 | 2 | 5 | NaN | NaN |
| store | dept | date | weekly_sales | temperature | fuel_price | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | cpi | unemployment | type | size | isholiday | year | month | week | lag1_weekly_sales | rolling_avg_4weeks | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 536624 | 45 | 85 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536625 | 45 | 87 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536626 | 45 | 90 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536627 | 45 | 91 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536628 | 45 | 92 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536629 | 45 | 93 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536630 | 45 | 94 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536631 | 45 | 95 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536632 | 45 | 97 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |
| 536633 | 45 | 98 | 2013-07-26 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 192.308899 | 8.667 | NaN | NaN | False | 2013 | 7 | 30 | 0.0 | 0.0 |